Evaluation of Computational Linguistic Techniques for Identifying Significant Topics for Browsing Applications

نویسندگان

  • Judith L. Klavans
  • Nina Wacholder
  • David Kirk Evans
چکیده

Evaluation of natural language processing tools and systems must focus on two complementary aspects: first, evaluation of the accuracy of the output, and second, evaluation of the functionality of the output as embedded in an application. This paper presents evaluations of two aspects of LinkIT, a tool for noun phrase identification linking, sorting and filtering. LinkIT [Evans 1998] uses a head sorting method [Wacholder 1998] to organize and rank simplex noun phrases (SNPs). LinkIT is to identify significant topics in domainindependent documents. The first evaluation, reported in D.K.Evans et al. 2000 compares the output of the Noun Phrase finder in LinkIT to two other systems. Issues of establishing a gold standard and criteria for matching are discussed. The second evaluation directly concerns the construction of the browsing application. We present results from Wacholder et al. 2000 on a qualitative evaluation which compares three shallow processing methods for extracting

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New Applications on Linguistic Mathematical Structures and Stability Analysis of Linguistic Fuzzy Models

In this paper some algebraic structures for linguistic fuzzy models are defined for the first time. By definition linguistic fuzzy norm, stability of these systems can be considered. Two methods (normed-based & graphical-based) for stability analysis of linguist fuzzy systems will be presented. At the follow a new simple method for linguistic fuzzy numbers calculations is defined. At the end tw...

متن کامل

Mining Fuzzy Weighted Browsing Patterns from Time Duration and with Linguistic Thresholds

World-wide-web applications have grown very rapidly and have made a significant impact on computer systems. Among them, web browsing for useful information may be most commonly seen. Due to its tremendous amounts of use, efficient and effective web retrieval has become a very important research topic in this field. Techniques of web mining have thus been requested and developed to achieve this ...

متن کامل

NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics

ii Introduction A construction can be defined as a form-meaning pairing in which the components cannot entirely explain the meaning of the whole. Constructional phenomena range from morphemes to argument structure, and include obvious examples like collocations (" hermetically sealed "), (idiomatic) expressions with fixed constituents (" kick the bucket "), expressions with (semi-)optional cons...

متن کامل

A Latent Variable Model for Geographic Lexical Variation

The rapid growth of geotagged social media raises new computational possibilities for investigating geographic linguistic variation. In this paper, we present a multi-level generative model that reasons jointly about latent topics and geographical regions. High-level topics such as “sports” or “entertainment” are rendered differently in each geographic region, revealing topic-specific regional ...

متن کامل

Discovery of Fuzzy Multiple-Level Web Browsing Patterns

Web usage mining is the application of data mining techniques to discover usage patterns from web data. It can be used to better understand web usage and better serve the needs of rapidly growing web-based applications. Discovery of browsing patterns, page clusters, user clusters, association rules and usage statistics are some usage patterns in the web domain. Web mining of browsing patterns i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000